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Abstract — In this work we propose an adaptive linear receiver 
structure based on interpolated finite impulse response (FIR) 
filters witti adaptive interpolators for direct sequence code divi- 
sion multiple access (DS-CDMA) systems in multipath channels. 
The interpolated minimum mean-squared error (MMSE) and the 
interpolated constrained minimum variance (CMV) solutions are 
described for a novel scheme where the interpolator is rendered 
time-varying in order to mitigate multiple access interference 
(MAI) and multiple-path propagation effects. Based upon the 
interpolated MMSE and CMV solutions we present compu- 
tationally efficient stochastic gradient (SG) and exponentially 
weighted recursive least squares type (RLS) algorithms for both 
receiver and interpolator filters in the supervised and blind 
modes of operation. A convergence analysis of the algorithms 
and a discussion of the convergence properties of the method are 
carried out for both modes of operation. Simulation experiments 
for a downlink scenario show that the proposed structures achieve 
a superior BER convergence and steady-state performance to 
previously reported reduced-rank receivers at lower complexity. 

Index Terms — DS-CDMA, multiuser detection, reduced-rank re- 
ceivers, adaptive algorithms. 



I. Introduction 

ADAPTIVE linear receivers E], Ell, O, H are a highly 
effective structure for combatting interference in DS- 
CDMA systems, since they usually show good performance 
and have simple adaptive implementation. The linear minimum 
mean-squared error (MMSE) receiver f3], ^ implemented 
with an adaptive filter is one of the most prominent design 
criteria for DS-CDMA systems. Such receiver only requires 
the timing of the desired user and a training sequence in 
order to suppress interference. Conversely, when a receiver 
loses track of the desired user and a training sequence is not 
available, a blind linear minimum variance (MV) receiver iQ, 
||6l that trades off the need for a training sequence in favor 
of the knowledge of the desired user's spreading code can be 
used to retrieve the desired signal. 

The works in |II]-|l6l were restricted to systems with short 
codes, where the spreading sequences are periodic. How- 
ever, adaptive techniques are also applicable to systems with 
long codes provided some modifications are carried out. The 
designer can resort to chip equalization (T] followed by a 
despreader for downlink scenarios. For an uplink solution, 
channel estimation algorithms for aperiodic sequences JS] , 19] 

This work was supported by the Brazihan Council for Scientific and Tech- 
nological Development (CNPq). Dr. R. C. de Lamare was with CETUC/PUC- 
RIO and is now a Lecturer with the Communications Research Group, 
Department of Electronics, University of York, York YOlO 5DD, United 
Kingdom and Prof. R. Sampaio-Neto is with CETUC/PUC-RIO, 22453-900, 
Rio de Janeiro, Brazil. Phone: +55-21-31141701 Fax: +55-21-22945748. E- 
mails: rcdl500@ohm.york.ac.uk, raimundo@cetuc.puc-rio.br 



are required and the sample average approach for estimating 
the covariance matrix R = E[r{i)r^ (i)] of the observed 
data r{i) has to be replaced by R = PP^ + ct^I, which is 
constructed with a matrix P containing the effective signature 
sequence of users and the variance of the receiver noise 
llOI . In addition, with some recent advances in random matrix 
theory flTl it is also possible to deploy techniques originally 
developed for systems with short codes in implementations 
with long codes. Furthermore, the adaptive receiver structures 
reported so far UJ-LiO] can be easily extended to asynchronous 
systems in uplink connections. In presence of large relative 
delays amongst the users, the observation window of each user 
should be expanded in order to consider an increased number 
of samples derived from the offsets amongst users. Alterna- 
tively for small relative delays amongst users, the designer can 
utilize chip oversampling to compensate for the random timing 
offsets. These remedies imply in augmented filter lengths and 
consequently increased computational complexity. 

In this context, a problem arises when the processing 
gain used in the system and the number of parameters for 
estimation is large. In these scenarios, the receiver has to 
cope with difficulties such as significant computational bur- 
den, increased amount of training and poor convergence and 
tracking performance. In general, when an adaptive filter with 
a large number of taps is used to suppress interference, then it 
implies slow response to changing interference and channel 
conditions. Reduced-rank interference suppression for DS- 
CDMA llT2ll - llT8l was originally motivated by situations where 
the number of elements in the receiver is large and it is 
desirable to work with fewer parameters for complexity and 
convergence reasons. Early works in reduced-rank interference 
suppression for DS-CDMA systems |12|, |13|, |14| were 
based on principal components (PC) of the covariance matrix 
R of the observation data. This requires a computationally 
expensive eigen-decomposition to extract the signal subspace 
which leads to poor performance in systems with moderate 
to heavy loads. An attempt to reduce the complexity of PC 
approaches was reported in lilSi with the partial despreading 
(PD) method, where the authors report a simple technique 
that allows the selection of the performance between the 
matched filter and the full-rank MMSE receiver. A promising 
reduced-rank technique for interference suppression, denoted 
multistage Wiener filter (MWF), was developed by Goldstein 
et al. in |17| and was later extended to SG and recursive 
adaptive versions by Honig and Goldstein in ifTSl . A problem 
with the MWF approach is that, although less complex than the 
full-rank solution, it still presents a considerable computational 
burden and numerical problems for implementation. In this 
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work, we present an alternative reduced-rank interference 
suppression scheme based on interpolated FIR filters with 
adaptive interpolators that gathers simplicity, great flexibility, 
low complexity and high performance. 

The interpolated FIR (IFIR) filter is a single rate structure 
that is mathematically related to signal decimation followed 
by filtering with a reduced number of elements ll20l . ET] . 
The basic idea is to exploit the coefficient redundancy in 
order to remove a number of impulse response samples, 
which are recreated using an interpolation scheme. The savings 
are obtained by interpolating the impulse response and by 
decimating the interpolated signal. This technique exhibits 
desirable properties, such as guaranteed stability, absence of 
limit cycles and low computational complexity. Thus, adap- 
tive IFIR (AIFIR) filters |22|,|23| represent an interesting 
alternative for substituting classical adaptive FIR filters. In 
some applications they show better convergence rate and can 
reduce the computational burden for filtering and coefficient 
updating, due to the reduced number of adaptive elements. 
These structures have been extensively applied in the context 
of digital filtering, although their use for parameter estimation 
in communications remains unexplored. 

Interference suppression with IFIR filters and time-varying 
interpolators with batch methods, which require matrix in- 
versions, were reported in ll24l . In this work, we investigate 
the suppression of MAI and intersymbol interference (ISI) 
with adaptive IFIR filters (that do not need matrix inver- 
sions) for both supervised and blind modes of operation in 
synchronous DS-CDMA systems with short codes. A novel 
AIFIR scheme where the interpolator is rendered adaptive is 
discussed and designed with both MMSE and MV design 
criteria. The new scheme, introduced in [25], |26|, yields a 
performance superior to conventional AIFIR schemes ||221 . 
||23 1 (where the interpolator is fixed) and a faster convergence 
performance than full-rank and other existing reduced-rank 
receivers. Computationally efficient stochastic gradient (SG) 
and recursive least squares (RLS) type adaptive algorithms are 
developed for the new structure based upon the MMSE and 
MV performance criteria with appropriate constraints to miti- 
gate MAI, ISI and jointly estimate the channel. The motivation 
for the novel structure is to exploit the redundancy found in 
DS-CDMA signals that operate in multipath, by removing a 
number of samples of the received signal and retrieving them 
through interpolation. The gains in convergence performance 
over full-rank solutions are obtained through the reduction of 
the number of parameters for estimation, leading to a faster 
acquisition of the required statistics of the method and a 
smaller misadjustment noise provided by the smaller filter 
lHj, \19\. Furthermore, the use of an adaptive interpolator 
can provide a time-varying and rapid means of compensation 
for the decimation process and the discarded samples. The 
novel scheme has the potential and flexibility to consistently 
yield faster convergence than the full-rank approach since 
the designer can choose the number of adaptive elements 
during the transient process and upon convergence increase the 
number of elements up to the full-rank. Unlike PC techniques, 
our scheme is very simple because it does not require eigen- 
decomposition and its performance is not severely degraded 
when the system load is increased. In contrast to PD, the adap- 



tive AIFIR structure jointly optimizes two filters, namely the 
interpolator and the reduced-rank, resulting in reduced-rank 
filters with fewer taps and faster convergence than PD since 
the interpolator helps with the compensation of the discarded 
samples. In comparison with the MWF the proposed scheme 
is simpler, more flexible and more suitable for implementation 
because the MWF has numerical problems in fixed point 
implementations . 

A convergence analysis of the algorithms and a discussion 
of the global convergence properties of the method, which 
are not treated in I24J - I.26J . are undertaken for both modes of 
operation. Specifically, we study the convergence properties of 
the proposed joint adaptive interpolator and receiver scheme 
and conclude that it leads to an optimization problem with 
multiple global minima and no local minima. In this regard and 
based on the analyzed convergence properties of the method, 
we show that the prediction of the excess mean square error 
(MSE) of both blind and supervised adaptive algorithms is 
rendered possible through the study of the MSE trajectory of 
only one of the jointly optimized parameter vectors, i.e. the 
interpolator or the reduced-rank filters. Then, using common 
assumptions of the adaptive filtering literature, such as the 
independence theory, we analyze the trajectory of the mean 
tap vector of the joint optimization of the interpolator and the 
receiver and MSE trajectory. We also provide some mathemati- 
cal conditions which explain why the new scheme with SG and 
RLS type algorithms is able to converge faster than the full- 
rank scheme. Although the novel structure and algorithms are 
examined in a synchronous downlink scenario with periodic 
signature sequences in this work, it should be remarked that 
they can be extended to long codes and asynchronous systems 
provided the designer adopts the modifications explained in 
the works reported in fTl-lTTI. 

This paper is organized as follows. Section II describes 
the DS-CDMA system model. The linear interpolated receiver 
principle and design criteria, namely the MMSE and con- 
strained MV (CMV) are described in Section III. Section 
IV is dedicated to the derivation of adaptive algorithms and 
Section V is devoted to the global convergence properties of 
the method and the convergence analysis of the algorithms. 
Section VI presents and discusses the simulation results and 
Section VII gives the concluding remarks. 

II. DS-CDMA SYSTEM MODEL 

Let us consider the downlink of a synchronous DS-CDMA 
system with K users, N chips per symbol and Lp propagation 
paths. The signal broadcasted by the base station intended for 
user k has a baseband representation given by: 

oo 

Xk{t)^Ak J2 bk{t)sk{t - iT) (1) 

i— — oo 

where bk{i) S {il} denotes the i-th symbol for user k, the 
real valued spreading waveform and the amplitude associated 
with user k are Sk{t) and A^, respectively. The spreading 
waveforms are expressed by Sk{t) = ^f^i ak{i)4>it ~ iTc), 
where akii) G {±1/ ^/N}, 4>{t) is the chip waveform, 
is the chip duration and TV = T/T^ is the processing gain. 
Assuming that the receiver is synchronized with the main path. 
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the coherently demodulated composite received signal is 
r{t) =Y^Y1 Ht)xk{t - Ti) + n{t) 



(2) 



fe=i 



where hi{t) and ti are, respectively, the channel coefficient 
and the delay associated with the l-th path. Assuming that 
Tk,i = ITc, the channel is constant during each symbol interval 
and the spreading codes are repeated from symbol to symbol, 
the received signal r{t) after filtering by a chip-pulse matched 
filter and sampled at chip rate yields the M = TV + Lp — 1 
dimensional received vector 



K 



r(i) = H(i) AkSkhkii) + n{i) 



(3) 



fe=i 



where n{i) = [ni{i) ... nMi)]^ is the complex Gaussian 
noise vector with E[n{i)n^ (i)] = cr^I, where (•)^ and (•)^ 
denotes transpose and Hermitian transpose, respectively, and 
E[.] is the expected value, the fc-th user symbol vector is 
hk{i) = [bk{i + Ls-l) ... bk{i) ... fefe(j - + 1)]^, where 
Lg is the ISI span and the ((2Ls — 1) x A^) x {2Ls — 1) matrix 
Sfc with non-overlapping shifted versions of the signature of 
user A; is 

Sfe ... 



Sfe 











Sfe 



(4) 



where the signature sequence for the fc-th user is = 
[afe(l) ... afe(iV)]^ and the M x ((2^^ - 1) x A^) channel 
matrix H(i) is 



H(z) 



ho{i) 
ho(i) 



hr 















ho{i) 



(5) 

where hi{i) = hi{iTc) for ^ = 0, . . . , Lp — 1. The MAI arises 
from the non-orthogonality between the received signals, 
whereas the ISI span Lg depends on the length of the chaimel 
response, which is related to the length of the chip sequence. 



For Lp = 1, Lg = 
N <Lp^ 2N, Ls 



1 (no ISI), for 
= 3 and so on. 



l<Lp^N,L, = 2, for 



111. Linear Interpolated CDMA receivers 

The underlying principles of the proposed CDMA receiver 
structure are detailed here. Fig. 1 shows the structure of 
an IFIR receiver, where an interpolator and a reduced-rank 
receiver that are time-vaiying are employed. The M x 1 

_ iJ.'i) Ji) IT 

'M-lJ 



-'kfi ■ 



''k,Ni 



is filtered by the 

j^]^, yielding the 



received vector r(i) = [tq 
interpolator filter Vfe(i) = 

interpolated received vector rfe(i). The vector rfe(z) is then 
projected onto an M/L x 1 vector ffe(i). This procedure 
corresponds to removing L — 1 samples of rfe(i) of each set 
of L consecutive ones. Then the inner product of ffe(i) with 
the M/L-dimensional vector of filter coefficients Wfe(i) = 
. . . 'i^fe M/L-i]^ computed to yield the output Xk{i). 
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Fig. 1. Proposed adaptive reduced-rank receiver structure. 



The projected interpolated observation vector Yk{i) = 
Drfc(?) is obtained with the aid of the M/L x M projection 
matrix D which is mathematically equivalent to signal deci- 
mation on the M X 1 vector rfe(i). An interpolated receiver 
with decimation factor L can be designed by choosing D as: 



D 
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{m—l)L zeros 











1 



{M/L—1)L zeros 



(L-1) 



(6) 

where m (m = 1, 2, . . . , Af/L) denotes the m-th row. The 
strategy, that allows us to devise solutions for both interpolator 
and receiver, is to express the estimated symbol Xfe(i) = 
wj^ {i)rk{i) as a function of Wfe(j) and Vfe(j) (we will drop 
the subscript k and symbol index (i) for ease of presentation): 

Xk{i) 



Wa^k ro +fi'iVfc ri 



w 



1 v?rM/L-i 



■li) 



M/L-1' 

w^(i) = v|^(z)K«w^(i) 
(7) 

where Ufe(z) = 3J(i)w^(i) is an A''/ x 1 vector, the M/L co- 
efficients of wj. (/) and the N; elements of Vfc(i) are assumed 
to be complex, the asterisk denotes complex conjugation and 
is{i) is a length Nj segment of the received vector v{i) 
begirming at rsxL{i) and 



(i) 



Ji) 
' L 



' (M/L- 



l)L 



(0 



' Ni-1 ' L+Ni-1 



' {M/L-1)L+1 



(j) 

''{M/L-l)L+Ni-l 



(8) 

The interpolated linear receiver design is equivalent to deter- 
mining an FIR filter Wfe(i) with M/L coefficients that provide 
an estimate of the desired symbol: 



^fc(*) = sgn[Re wf (i)rfe(i 



(9) 



where i?e(-) selects the real part, sgn{-) is the signum function 
and the receiver parameter vector Wfe is optimized according 
to a selected design criterion. 

A. MMSE Reduced-Rank Interpolated Receiver Design 

The MMSE solutions for Wfe(i) and Vfe(i) can be computed 
if we consider the optimization problem whose cost function 
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IS 



-R^i(i)DCfcA 



(10) 



where is the desired symbol for user k at time index 

(i). By fixing the interpolator Vfc(i) and minimizing (10) with 
respect to Wfe(i) the interpolated Wiener filter/receiver weight 
vector is 

Wfe(^) = a(vfc) = R-l(^)pfe(^) (11) 

where Rfc(^) = i;[rfc(i)ff (i)], pk{t) = i;[6^(i)rfc(i)], 
ffe(i) = K"^(i)v^(i) and by fixing viffc(i) and minimizing (10) 
with respect to Vk{i) the interpolator weight vector is 



(12) 



where R„^(i) = £'[ufe(i)uf (i)], p„Ji) £'[6^(i)ufc(i)] and 
u/j(i) = J?(i)w^.(i). The associated MSE expressions are 

J(vfe) = JA./S£(a(vfc),vfc) = a^-pf (i)R-i(i)pfc(i) (13) 

JMSB(wfe,/3(wfe)) = - pfj*)R-^i(j)p„,(*) (14) 

where cr^ = £'[|6(i)p]. Note that points of 
global minimum of (10) can be obtained by 
Vfc,opt = argminvj, J(vfe) and ^^k,opt = oc{vk,opt) 
or Wk,opf = argminw;, Jj\/S£;(wfc,/3(wfe)) and 
^k,opt = f3{'vvk,opt)- At the minimum point (13) equals 
(14) and the MMSE for the proposed structure is achieved. 
We remark that (11) and (12) are not closed-form solutions 
for Wfe(i) and Vfc(i) since (11) is a function of Vfe(i) and 
(12) depends on Wk{i) and thus it is necessary to iterate 
(11) and (12) with an initial guess to obtain a solution, as in 
Il24l . An iterative MMSE solution can be sought via adaptive 
algorithms. 

B. Constrained Minimum Variance ( CMV) Reduced-Rank In- 
terpolated Receiver Design 

The interpolated CMV receiver parameter vector and the 
interpolator parameter vector Vfe are obtained by minimizing 



^Afy(wfc, Vfc) = E \xk{i)\ 



= E 



= Wfc (i)RfeWfe(i) 



|vf(^)5^(^)w*(^)|^ 

vf(»)Ru,Vfc(z) 



(15) 

subject to the proposed constraints C^D^Wfc(i) = g(i) 
and ||vfc(i)|| = 1, where the AI x Lp constraint matrix Cfe 
contains one-chip shifted versions of the signature sequence 
of user k, g(i) is an ip-dimensional parameter vector to be 
determined. The vector of constraints g(i) can be chosen 
amongst various criteria although in this work we adopt 
g(i) as the channel parameter vector (g = [ho . . . h^^^i]^) 
because it provides better performance than other choices 
as reported in ||6]. The proposed constraint ||vfe(i)|| = 1 
ensures adequate design values for the interpolator filter v^, 
whereas C^D^Wfe(i) — g(i) avoids the suppression of the 
desired signal. By fixing v^, taking the gradient of the La- 
grangian function JMy(wfe,Vfe) = E [|vf (i)3ft(i)w^(i)| 



Re 



(Cf D^wfc(*) - g(*))^A 



where A is a vector of La- 



grange multipliers, with respect to Wfe and setting it to we 
get: 

£;[rfc(^)rf Wfc(z) + DCfeA = 



Using the constraint C^D^Wfc(j) = g(i) and substituting 
Wfc(i) = — Rj^^(j)DCfeA we arrive at 

A ^ -(Cf D^R^^(i)DCfc)-igfc(i). The resulting expres- 
sion for the receiver is 

Wfe(^) = ao(vfc) = Rfe(^)-lDCfe(Cf D^Rfc(z)-iDCfc)-ig(z) 

(16) 

and the associate minimum output variance is 



- g^W(Cf D^RfcW-iDCfc)-ig(z) 



(17) 



By fixing w^, the solution that minimizes (15) is: 

Vfc(i) /3o(wfc) = argmin v^RuJi)v (18) 

V 

subject to 1 1 Vfc (i) 1 1 = 1. Therefore, the solution for the interpo- 
lator is the normalized eigenvector of Ru^ corresponding to its 
minimum eigenvalue, via singular value decomposition (S VD). 
As occurs with the MMSE approach we iterate (16) and (18) 
with an initial guess to obtain a CMV solution 1, 24 J . Note also 
that (16) assumes the knowledge of the channel parameters. 
However, in applications where multipath is present these 
parameters are not known and thus channel estimation is 
required. To blindly estimate the channel we use the method 
of il, ED: 



= argmin g^Cf R-™(*)Cfeg 



(19) 



subject to ||g|| = 1, where R(i) = E[r{i)r^ (i)], m is a finite 
power and whose solution is the eigenvector corresponding to 
the minimum eigenvalue of the Lp x Lp matrix C^R(i)^™Cfc 
through SVD. Note that in this work we restrict the values of 
TO to 1 although the performance of the channel estimator and 
consequently of the receiver can be improved by increasing to. 
In the next section, we propose iterative solutions via adaptive 
algorithms. 



IV. Adaptive algorithms 

We describe SG and RLS algorithms JJTj (see Chapters 
9 and 13) that adjust the parameters of the receiver and the 
interpolator based on the MMSE criterion and the constrained 
minimization of the MV cost function ||25]| , ||26l . The novel 
structure, shown in Fig. 1 and denoted INT, for the receivers 
gathers fast convergence, low complexity and additional flexi- 
bility since the designer can adjust the decimation factor L and 
the length of the interpolator Nj depending on the needs of the 
application and the hostility of the environment. Based upon 
the MMSE and CMV design criteria, the proposed receiver 
structure has the following modes of operation: training mode, 
where it employs a known training sequence; decision directed 
mode, which uses past decisions in order to estimate the 
receiver parameters; and blind mode, which is based on the 
CMV criterion and trades-off the training sequence against the 
knowledge of the signature sequence. The complexity in terms 
of arithmetic operations of the algorithms associated with the 
INT and the existing techniques is included as a function of 
the number of adaptive elements for comparison purposes. 
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A. Least mean squares (LMS) algorithm 

Given the projected interpolated observation vector fk{i) 
and the desired symbol bk{i), we consider the following cost 
function: 



(20) 



Taking the gradient terms of (20) with respect to Wfe(i), Vfc(i) 
and using the gradient descent rules ||3T| (see Chapter 9, pp. 
367-371) Wfc(i + 1) = Wfe(i) - fj.Vi'i'^ and Vfc(i + 1) = 
Vfc(i) - ■qVii'^" yields: 

Vfc(i + 1) = Vfe(i) + ?7e^(j)ufe(i) (21) 

Wfe(« + 1) = Wfe(i) + ^iel{i)fk{i) (22) 

where efc(i) — bk{i) — Wfc(i)^rfc(i) is the error for user k, 
Ufe = 5i(i)wfc(i), /i and 77 are the step sizes of the algorithm 
for Wfc(i) and Vk{i). The LMS algorithm for the proposed 
structure described in this section has a computational com- 
plexity 0{M/ L+Nj). In fact, the proposed structure trades off 
one LMS algorithm with complexity 0{M) against two LMS 
algorithms with complexity 0(M/L) and 0{Ni), operating in 
parallel. It is worth noting that, for stability and to facilitate 
tuning of parameters, it is useful to employ normalized step 
sizes and consequently NLMS type recursions when operating 
in a changing environment and thus we have 11(1) = 
and ri{i) — ,., as the step sizes of the algorithm for 

Wfc(i) and v^fi), where fiQ and 779 are the convergence factors. 

B. Recursive least squares (RLS) algorithm 

Consider the time average estimate of the matrix R^, 

required in (11), given by Rfc(i) = Z^j^i a*^'ffc(Orf (0' 
where a (0 < a ^ 1) is the forgetting factor, that can be 
alternatively expressed by Rfe(i) = Q!Rfc(i — 1) + ffc(i)f^(i). 
To avoid the inversion of Rfe(i) required in (11), we use the 

matrix inversion lemma and define Pfc(i) = R^ (i) and the 
gain vector Gk{i) as: 

a-iPfc(i-l)ffc(i) 



Gfc(^) 



l + a-iff (i)Pfc(i- l)rfe(z) 



(23) 



and thus we can rewrite Pfe(i) as 

Pfc(z) = a-iPfe(z - 1) - a-iGfc(z)ff (^)Pfc(^ - 1) (24) 

By rearranging (23) we have Gfc(i) = a~^Pk{i — l)ffc(«) — 
a"^Gfc(«)rf (j)Pfc(i - l)ffc(«) = Pfc(i)ffc(j) . By employing 
the LS solution (a time average of (11)) and the recursion 

Pk{i) = apk{i - 1) + ^k{i)bl{i) we obtain 

- -1 

Wfc(j) ^ Rfe (i)pfc(i) = aPfc(«)pfc(i - 1) + Fk{i)rk{i)bl{i) 

(25) 

Substituting (24) into (25) yields: 



Wfe(i)=Wfc(i-l) + Gfc(z)^^(z) 



(26) 



where the a priori estimation error is described by ^kii) ~ 
bkii) — — l)rfe(i). Similar recursions for the interpolator 
are devised by using (12). The estimate R^. can be obtained 
through Rufc(*) = Si=i '^^feW^fc^lO ^^i^ can be alter- 
natively written as Rufc(j) — aR-ufc(* — 1) + Ufc(i)u^(i). 



To avoid the inversion of R^. we use the matrix inversion 
lemma and again for convenience of computation we define 
Pufc(*) — R-u^l*) and the Kalman gain vector Guj.(«) as: 



a ^Pufc(» - l)ufc(0 
l + a-iuf(z)P„,(j-l)ufc(*) 



(27) 



and thus we can rewrite (27) as 

P„,(z) = a-iPu,(z-l)-a-iG„,(z)uf (i)P„,(*-l) (28) 

By proceeding in a similar approach to the one taken to obtain 
(26) we arrive at 



VfcW=Vfe(z-l) + Gu,(z)4*« 



(29) 



The RLS algorithm for the proposed structure trades off a 
computational complexity of 0{AP) against two RLS algo- 
rithms operating in parallel, with complexity 0{{M / L)"^) and 
0{Nf), respectively. Because Ni is small (TV/ << M, as 
will be shown later) the computational advantage of the RLS 
combined with the INT structure is rather significant. 

C. Constrained minimum variance stochastic gradient ( CMV- 
SG) algorithm 

Consider the unconstrained Lagrangian MV cost function: 

Jmv - (vf (z)ufe(*)uf (z)v,(z)) + A^(Cf D«wfe(z) - g(z)) 
+ (wf(z)DC,-g^«)A 

(30) 

where A is a vector of Lagrange multipliers. An SG solution 
can be devised by taking the gradient terms of (30) with 
respect to Wfc(i) and Vfc(«) as described by Wfc(i + 1) = 
Wfc(i)-A^«)(«)VJwfc(i), Vfe(i + 1) = Vfe(i)-?7(i)VJv^(i) which 
adaptively minimizes Jmv with respect to Wfc(i) and Vfc(i). 
Substituting the gradient terms the equations become 

Wk{i + 1) = WfeCO - fi^{i){x*k{i)rk{i) + DCfeA(«)) (31) 

Vfc(i + 1) = Vfe(i) - r]{i)xl{i)uk{i) (32) 

where Xk{i) ~ w^(i)ffe(i) ~ v|^(i)ufc(i). We use (32) and 
can make Vk{i + 1) ^ Vk{i + l)/\\vk{i + 1)|| to update the 
interpolator v^,. It is worth noting that in our studies the nor- 
malization on SG algorithms does not lead to different results 
from the ones obtained with a non-normalized interpolator 
recursion. In this regard, analyzing the convergence of (32) 
without normalization is mathematically simpler and gives 
us the necessary insight into its convergence. By combining 
the constraint CfcD^Wfc(i) — g(i) and (32) we obtain the 
Lagrange multiplier 

A(z) = (Cf D^DCfc)-ix(C^Dwfc(i)-M»C^D4WffcW"g«) 

(33) 

By substituting (33) into (31) we arrive at the update rules for 
the estimation of the parameters of the receiver w^: 

wfe(*+l) = nfc(wfc(z)-M„(i)x:.(z)ffc(i))+DCfc(Cf D^DCfe)-ig(i) 

(34) 

where Hfe = I - DCfc(Cf D^DCfc)-iCf is a matrix 
that projects Wfe onto another hyperplane to ensure the con- 
straints. 

Normalized versions of these algorithms can be devised 
by substituting (32) and (34) into the MV cost function. 
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differentiating the cost function with respect to and 
IJ.v{i), setting them to zero and solving the new equations. 
Hence, the CMV-SG algorithm proposed here for the INT 
receiver structure adopts the normalized step sizes /itu(i) = 
-u,-,^ - /■\ and nii) = ,., where llq and rjo are the 
convergence factors for w/j and v^, respectively. 

The channel estimate g(i) is obtained through the power 
method and the SG technique described in [27 1. The method 
is an SG adaptive version of the blind channel estimation 
algorithm described in (19) and introduced in [28] that requires 
only 0{Lp) arithmetic operations to estimate the channel, 
against 0{Lp) of its SVD version. In terms of computational 
complexity, for the rejection of MAI and ISI, the proposed 
blind interpolated receiver trades off one blind algorithm 
with complexity 0{M) against two blind algorithms with 
complexity 0(il//L) and 0(Af/), operating in parallel. 

D. Constrained minimum variance recursive least squares 
(CMV-RLS) algorithm 

Based upon the expressions for the receiver Wfe and inter- 
polator Vfc in (16) and (18) of the interpolated CMV receiver, 
we develop a computationally efficient RLS algorithm for the 
INT structure that estimates the parameters of and v^. 

The iterative power method |.29J (see pp. 405 — 408), [30J (see 
pp. 314 — 333) is used in numerical analysis to compute the 
eigenvector corresponding to the largest singular value of a 
matrix. In order to obtain an estimate of and avoid the SVD 
on the estimate of the matrix Ruj. («) we resort to a variation of 
the iterative power method to obtain the eigenvector of (*) 
which corresponds to the minimum eigenvalue. 

Specifically, we apply the power method to the difference 
between Ru^ (i) and the identity matrix I, rather than applying 
it to the inverse of Ru^(i). This approach, known as shift 
iterations |30|(see pp. 319), leads to computational savings on 
one order of magnitude since direct SVD requires 0{Nf), 
while our approach needs 0{Nf). The simulations carried 
out reveal that this method exhibits no performance loss. 
Hence, we estimate Ru^(j) via the recursion R.u;,(i) = 
^^^Q a'~"ufc(n)u^(n) and then obtain the interpolator 
with a one step iteration given by: 



Vfc(i) = (I - z^fc(«)Rufc(j))vfc(j - 1) 



(35) 



where i^kii) — l/^''[R-ut.(i)]- After that, we make Vfc(i) 
Vfc(i)/||vfe(i)|| to normalize the interpolator This procedure 
is based on the following result. 

Lemma: Let R be a positive semi-definite Hermitian sym- 
metric matrix and qmin the eigenvector associated to the 
smallest eigenvalue. If qmm is unique and of unit norm, then 
with V ~ l/tr[H] the sequence of vectors v(i) = |^|^||| 
with v(i) = (I — i'(i)R)v(i — 1) converges to Qmin, provided 
that v(0) is not orthogonal to q„iiri- A proof is shown in the 
Appendix. 

To recursively estimate the matrix Rfc(i) and avoid its 
inversion we use the matrix inversion lemma and Kalman RLS 
recursions lISTI : 



G{^) 



g-iRfc (z-l)ffc(z) 
a-iffWRfe\»-l)ffcW 



= a-iR/(* - 1) - a-iG{i)rf (i)R, (37) 

where < a ^ 1 is the forgetting factor. The algorithm can be 
initiahzed with Rfe^(O) = SI and Ruj|(0) = SI, where S is 
a large positive number For the computation of the reduced- 
rank receiver parameter vector we use the matrix inversion 
lemma ||3ll to estimate {C^'D"R~^{i)I)Cky^ as given by: 



1 



1 



r.T\*-i)- 



. _1 ^^^^ 

where Tk{i) is an estimate of (CfD^Rj. {i)T>Ck) and 
= C^D^rfe(i) and then we construct the reduced-rank 
receiver as : 



Wfe(i) = Kkiiy 



^DC,r-i(z)g(z) 



(39) 



(36) 



The channel estimate g(i) is obtained through the power 
method and the RLS technique described in [28). Following 
this approach, the SVD on the Lp x Lp matrix C|^R~^(i)Cfe, 
as stated in (19) and that requires 0{Lp), is avoided and 
replaced by a single matrix-vector multiplication, resulting in 
the reduction of the corresponding computational complexity 
on one order of magnitude and no performance loss. In terms 
of computational complexity, the CMV-RLS algorithm with 
the interpolated receiver trades off one blind algorithm with 
complexity 0(A/^) against two with complexity 0(M-^/L^) 
and 0{Nf) operating in parallel. Since Nj is small as 
compared to M, it turns out that the new algorithms offer 
a significant computational advantage over conventional RLS 
algorithms. 



E. Computational Complexity 

In this section we illustrate the computational complexity 
of the proposed INT structure and algorithms. In Table 1 
we consider supervised algorithms, whereas the complexity 
of blind algorithms is depicted in Table 2. Specifically, we 
compare the full-rank, the proposed INT structure, the PD, 
the PC and the MWF structures with SG and RLS algorithms. 

In general, the INT structure introduces the term M /L, 
which can reduce the complexity by choosing the decimation 
factor L ^ 2. This is relevant for algorithms which have 
quadratic computational cost with M, i.e. the blind and trained 
RLS and the blind SG, because the decimation factor L in 
the denominator favors the proposed scheme which requires 
complexity 0{{M/ L)^). This complexity advantage is not 
verified with linear complexity recursions. For instance, with 
NLMS algorithms the proposed INT has a complexity slightly 
superior to the full-rank. Among the other methods, the PD 
is slightly more complex than the INT. A drawback of PC 
methods is that they require an SVD with associated cost 
0(M'^) in order to compute the desired subspace. Although 
the subspace of interest can be retrieved via computationally 
efficient tracking algorithms [T3l, (T^, these algorithms are 
still complex {0{M)^) and lead to performance degradation 
as compared to the actual SVD. The MWF technique has a 
complexity 0{DM^), where the variable dimension of the 
vectors M — M — d varies according to the orthogonal 
decomposition and the rank d— 1, . . . , D. 
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(a) Supervised Recursive Algorithms 



(b) Blind Recursive Algorithms 




INT(L.4) 
PC(D.K)-SVD 
PC(D=K)-Song S Roy 
PD{D=M/2 taps) 
PD{D-M/4 taps) 
MWF(D=4)-recursive 




- INT(L-2) 

- INT(L.3) 
INT(L.4) 

- PC(D.K)-Wang S Poor-SVD 

- PC{D-K)- Song & Roy 

- MWF(D=4)-recursive 
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Fig. 2. Complexity in terms of arithmetic operations versus number 
of received samples (M) for (a) supervised and (b) blind recursive 
adaptation algorithms. 



In order to illustrate the complexity trend in a compre- 
hensive way, we depict in Fig. 2 curves which describe the 
computational complexity in terms of the arithmetic operations 
(additions and multiplications) as a function of the number of 
parameters M for recursive algorithms. For these curves, we 
consider Lp = Q and assume that D is equal to M/2 for the 
eigen-decomposition approaches. We also include the compu- 
tational cost of the algorithm of Song and Roy |14|, which is 
capable of significantly reducing the cost required by SVD. 
In comparison with the existing reduced-rank techniques, the 
proposed INT scheme is significantly less complex than the 
PC and the MWF and slightly less complex than the PD. This 
is because the analyzed algorithms have quadratic cost (PC 
with SVD has cubic cost), whereas the INT has complexity 
0((Af/L)2), as shown in Tables I and II. 



TABLE I 

Computational complexity of supervised adaptation 
algorithms. 



Algorithm 

LMS-Full-rank 
LMS-INT 



LMS-PC 
LMS-PD 
MWF-SG 



Number of operations per symbol 



Additions 

m 

2M. + 2Ni 
+NiM + {f)Ni + 2 
0{M^) + 2D 
{D ~ 1)2 + 2D + 1 
D{2{M - i f +M + 3) 



Multiplications 

— 2M + i 



RLS-FuIl-rank 



RLS-INT 



RLS-PC 

RLS-PD 
MWF-Recursive 



3(M 



l7+ 



3f- + 2Ni 
+{f)Ni 
+ 2D + 1 
DI + 2D + 2 
D{2M'^ + 5M + 



TABLE II 

Computational complexity of blind adaptation 
algorithms. 



Algorithm 



Number of operations per symbol 
Additions 



Multiplications 



CMV-SG- 
FuU-rank 
CMV-SG-INT 

MWF-SG 



+ MLp + 2M + 1 

(ff + (f)ip 
+NiM + (f)Ni 

+2^+Ni+2 
D{2{M -if + 2) 



CMV-RLS- 
FuU-rank 

CMV-RLS-INT 



PC-Wang&Poor 
MWF-Recursive 



4(Af 
+3(Lp 



A-r + MLp + 3M 

+ f iV,+4f 
+Ni 

D{2M^ + 3M + 5) 



7M^ + M 



L; + 2Lp + MLp 

4(f -l)' + (f)' + 
+3{Lp-lf + 2fL 
+NiM + 3Lp-l 
+(f - l)Ni + {Ni ~ 



M'^ + 2{M - 
D{3{M - if 



If 



+MLp + ip + 4 

+Lp + —Lp 
+ 2 + Nf 



+Lj, 



+ Ni 



- 2M) 



0(M-^ 



2M^ + M 



D{3M^ + 2M + 3) 



V. GLOBAL Convergence Properties of the Method 
AND Convergence Analysis of Algorithms 

In this section we discuss the global convergence of the 
method and its properties, the trajectory of the mean tap 
vectors, of the excess mean square error and the convergence 
speed. Specifically, we study the convergence properties of 
the proposed joint adaptive interpolator and receiver scheme 
and conclude that it leads to an optimization problem with 
multiple global minima and no local minima. In this regard and 
based on the analyzed convergence properties of the method, 
it suffices to examine the MSB trajectory of only one of the 
jointly optimized parameter vectors (wfe or v^) in order to 
predict the excess MSB of both blind and supervised adaptive 
algorithms. We also provide a discussion of the speed of 
convergence of the INT as compared to the full-rank. 

A. Global Convergence of the Method and its Properties 

1) Interpolated MMSE Design: Let us first consider the 
trained receiver case and recall the associated MSB ex- 
pressions in (13) and (14), namely Jj\/s£;(vfc, Q:(vfc)) = 
^(vfc) = crl^ -pf (i)R^\i)pfc(i) and JMSB(/3(wfc), Wfc) = 



gI - p£(^)R-H^)p«.(^) where = E\\hk{i)\\ Note 



opt 



"Ttlat points of global minimum of Jj\/s£;(wfc(i), Vfc(i)) 

argminvfc J(vfc) and vfopt = a(vopt) or vfopt = 
argminwfc JAfS£(/3(wfc), Wfc) and Vopt = f3{^opt)- At a 
minimum point JMS_E(vfe, "(vfe)) equals Jj\/sB(/3(wfc), Wfc) 
and the MMSB for the proposed structure is achieved. We 
T^urther note that since J(vfc) = J{tVk), for every t ^ 0, 



M^ +2M 



6M^ 



3(4^ 



+ 3{Ni ~ 
■ - l)Ni 

M \2 



6(f 



6Nf 



+NiM + {f 



+Nf + 2f + 2Ni 
M^ + 3{D-lf 
+1)2 
4(D -if - 
D(4(Af - : 



+3f 



2D 

D'-' + 2D 
f + 2M) 



+ 2M + 2 then if v^, is a point of global minimum of J(vfc) then tv^, is 
also a point of global minimum. Therefore, points of global 
minimum (optimum interpolator filters) can be obtained by 
= argmin||vj.||=i J(vfc). Since the existence of at least one 
point of global minimum of J(vfc) for | |vfc 1 1 = 1 is guaranteed 
by the theorem of Weierstrass [32] (see Chapter 2, Section 2. 1, 
Appendix B), then the existence of (infinite) points of global 
3piinimum is also guaranteed for the cost function in (10). 
In the context of global convergence, a sufficient but not 
necessary condition is the convexity, which is verified if its 



+iV/ + 2 
0{M^) + 6D^ 

+2D + 2 
7D^ + 2D_+ 2 
D{4:M'^ + 2M + 



g 



Hessian matrix is positive semi-definite, that is a^Ha 0, 
for any vector a. Firstly, let us consider the minimization of 



vf(z)SR(z)w^ 



fixed interpolators. Such optimization leads to the following 



Hessian H 



with 



Rfe(«), 



which is positive semi-definite and ensures the convexity of 
the cost function for the case of fixed interpolators. Let us 
now consider the joint optimization of the interpolator and 
receiver through an equivalent cost function to (10): 



where B — 







(40) 

is an (iV/ + NjL) x {Ni + NjV) 



matrix, the Hessian (H) with respect to — [w^. vj, 
H = 4r^%f^ = i?[(zf Bzfe-6,)B^]-^i?[(zf B^z,- 
&*)B] +i;[Bzfczf B^] +£;[B^Zfezf B]. By examining H we 
note that the third and fourth terms yield positive semi-definite 
matrices (a^£;[Bzfezf B^]a ^ and a^£'[B^Zfczf B]a ^ 
0, Zfe 7^ 0) whereas the first and second terms are indefinite 
matrices. Thus, the cost function cannot be classified as 
convex. However, for a gradient search algorithm, a desirable 
property of the cost function is that it shows no points of 
local minimum, i.e. every point of minimum is a point of 
global minimum (convexity is a sufficient, but not necessary, 
condition for this property to hold) and it is conjectured that 
the problem in (40) has this property. 



(a) Surlace of J(w,v) = (1 -w'r'vf 



(b) Contour of J{w,v) = (1-w'r'v) 












global 

minimum points / 









ir Performance Surface of Interpolated IVIMSE Deslgr 
N=31. K = 6 users 



(d) Vanance Performance SuM^ce of Ir 
N=31 . K=6 




Fig. 3. (a) Error Performance Surface of the function /(?;, w) = (1 — 
w*r*vf' , (b) Contour plots showing that the function does not exhibit 
local minima and has multiple global minima, (c) Error performance 
surface of interpolated MMSE receivers at Eb/No = 15d-B for L = 3 
and(d) Variance performance surface of Jmv (v) for CMV receivers 
at Et/No — 15dB for L = 2 and channel with paths given by 0, 
—6 and —10 dB, spaced by Tc. 



To support this claim, we carried out the following studies: 
• Let us consider the scalar case of the function in (40), 
which is defined as f{w,v) = [h — w r vY = — 
2b w r v + {w ^ vY, where r is a constant. By choosing 
V (the "scalar" interpolator) fixed, it is evident that the 
resulting function /(w, v) = [h — w cY, where c is a 
constant is a convex one, whereas for a time-varying 



interpolator the curves shown in Fig. 3 (a) and (b), 
indicate that the function is no longer convex but it also 
does not exhibit local minima. 
• By taking into account that for small interpolator filter 
length Ni {Nj ^ 3), can be expressed in spherical 
coordinates and a surface can be constructed. Specifically, 
we expressed the parameter vector as follows: Vfc = 
r[cos{9)cos{(l)) cos{9)sin{(j)) sin{9)]'^, where r is the 
radius, 9 and (j> were varied from —tt/2 to tt/2 and — tt to 
TT, respectively, and (13) was plotted for various scenarios 
and conditions (SNR, different channels, etc). The plot of 
the error-performance surface of J{\'k), depicted in Fig. 
3 (c), reveals that J(va;) has a global minimum value 
(as it should) but do not exhibit local minima, which 
implies that (40) has no local minima either It should 
be noted that if the cost function in (40) had a point of 
local minimum then J(vfe) in (13) should also exhibit 
a point of local minimum even though the reciprocal is 
not necessarily true: a point of local minimum of J(vfc) 
may coiTespond to a saddle point of Jj\/s£;(vfc, w^), if 
it exists. Note also that the latitude X longitude plot in 
Fig. 3 (c) depicts its two symmetric global minima in the 
unit sphere. 

> An important feature that advocates the non-existence 
of local minima is that the algorithm always converge 
to the same minimum value, for a given experiment, 
independently of any interpolator initialization (except for 
v(0) = [0 ... 0]^ that eliminates the signal) for a wide 
range of SNR values and channels. 

2) Interpolated CMV Design: For the blind case, let us 
first consider the minimization of JMV'(wfc(i), Vfc(i)) = 



E 



|vf(i)5R(z)w*(» 



with fixed interpolators subject to 

C^D-^Wfc(i) = g(i) and ||vfc(i)|| = 1. It should be noted 
that global convergence of the CMV method has been estab- 
lished in [6 1 and here we treat a similar problem when fixed 
interpolators are used. Such optimization leads to the following 

£;[rfc(i)rf(i)] =Rfc(i), which 



Hessian H 



d (Jmv{-)} 



dv 



is positive semi-cfefinite and ensures the convexity of the cost 
function for the case of fixed interpolators. 

Consider the joint optimization of the interpolator Vk and 
receiver Wfe via an equivalent cost function to (10): 



Jmv(z) =£;[|zfBzfc|2] 
subject to C^D^Wfe(i) = g(i), where B = 



(41) 





^ R 

is an {Ni + N/L) x {Nj + N/L) matrix, 'the Hes 
sian (H) with respect to zj. = [w^ "^fe]"^ is H = 
= E[zjfBzkB"] + £;[zfB^ZfeB] -i 



d d{.JMSE{.)) 



Szfc 

i;fBzfczf B^] + £[B^Zfczf B]. By examining H we note 
that, as occurs for the MMSE case, the third and fourth terms 
yield positive semi-definite matrices (a^i?[BzfcZ^B^]a ^ 
and a^i?[B^Zfczf B]a ^ 0, z^ 7^ 0) whereas the first and 
second terms are indefinite matrices. Hence, the cost function 
cannot be classified as convex, although we conjecture that it 
does not exhibit local minima. Thus, we proceed similarly to 
the MMSE case to study the surfaces provided by the problem 
in (41). Then, we carried out the following studies: 

• We have also plotted the variance performance surface of 
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Joi'^k) in (17), depicted in Fig. 3 (d). This surface reveals 
that Joi'Vk) has a global minimum (as it should) but does 
not exhibit local minima, which implies that (41) subject 
to C^T>^-Wk{i) = g(*) has no local minima either 
• Another important feature that suggests the non-existence 
of local minima for the blind algorithms is that they 
always converge to the same minimum value, for a given 
experiment, independently of any interpolator initializa- 
tion (except for v(0) = [0 ... 0]"^ that eliminates the 
signal) for a wide range of parameters. 

B. Trajectory of the Mean Tap Vectors 

This part is devoted to the analysis of the trajectory of the 
mean tap vectors of the proposed structure when operating in 
blind and supervised modes. In our analysis, we employ the 
so called Independence Theory ([T|, ||3T1 (see Chapter 9, pp. 
390-404) that consists of four points, namely: 

1. The received vectors r(l), . . . , r(i) and their interpolated 
counterparts ffc(l), . . . ,ffc(i) constitute a sequence of statisti- 
cally independent vectors. 

2. At time i, r{i) and ffc(i) are statistically independent of 
6fc(l),...,6fc(i-l). 

3. At time i, bk{i) depends on r{i) and rfe(i), but is 
independent of previous bk{n), for n = 1, . . . , i — 1. 

4. The vectors r(i) and rfe(i) and the sample bk are mutually 
Gaussian-distributed random variables. 

In the present context, it is worth noting that the indepen- 
dence assumption holds for synchronous DS-CDMA systems 
lITl, which is the present case, but not for asynchronous 
models, even though it provides substantial insight. 

1) Trained algorithm: To proceed, let us drop the user k 
index for ease of presentation and define the tap error vectors 
e^(i) and e.„(i) at time index i 



e^{i) = w(i) - Wopf, e„(i) = v(i) - v, 



Opt 



(42) 



where Wopt and Vopt are the optimum tap vectors that achieve 
the MMSE for the proposed structure. Substituting the expres- 
sions in (42) into (21) and (22) we get 

e^i + 1) = [I - (i)f^(i)]e^(i) + ^ir{i)e*{i) (43) 

e„(z + 1) = [I - ?7u(^)u^(^)]e,(^) + 7^u{i)e*{i) (44) 

By taking expectations on both sides we have 

E[e^{i + 1)] = [I - ^R(^)]^;[e„(^)] + pE[r{i)e* (i)] (45) 

E[e,{i + 1)] = [I - 7]Ii^ii)]E[e,ii)] + 7]E[uii)e* (i)] (46) 

At this point, it should be noted that the two error vectors have 
to be considered together because of the joint optimization of 
the interpolator filter and the reduced-rank filter Rewriting 
the terms E[f{i)e*{i)] and E[u{i)e* (i)], using (42) and the 
independence theory f3T| (see Chap. 9, pp. 390-404) we obtain 

E[rit)e*{i)] = p(z) - E[rii)^^{z)Si"it)]E[e^it)]- 



E[r{i)wl,^*]E[e,,ii)] - £;[r(z)wjp,r 



^;[u(^)e*(^)] = Pu(*) - E[u{i)w^ {t)]E[e,{i)] 



Vopt J 

(47) 



E[uii)wJ„,U"]E[eUi)] - E[uii)wJ„,U*v. 



"optl 



By combining (45), (46), (47) and (48) the trajectory of the 
error vectors is given by: 



E[e^{i + l)] 
E[e„{i + 1)] 



= A 



E[eUi)] 
E[e,{i)] 



B 



(49) 



where 
A 

and 

B = 



■(I - imK) 



At£;[f(i)vr(i)sR^^(i)] 

T 
opt 



~/.i?[r(i)w^p,R*(i)] 
-»?Ru) -r;£;[u(i)w^(i)R*(i)l 



^lE[r{i)w^ tU*Vopt] 
VPu{i) - iiE[u{i)w^ptU*Vopt\ 



Equation (49) 



implies that the stability of the algorithms in the proposed 
structure depends on the matrix A. For stability, the 
convergence factors should be chosen so that the eigenvalues 
of A^A are less than one. 

2) Blind algorithm: The mean vector analysis of the blind 
algorithm is slightly different from [6 1 because our approach 
uses a decoupled SG channel estimation technique |28|, that 
yields better channel estimates. Hence, we consider the joint 
estimation of and v^, while g is a decoupled estimation 
process. To proceed, let us drop the user k index for ease of 
presentation and substitute the expressions of (42) into (32) 
and (34) that gives: 

e„,(i + 1) = [I ^ nr{i)r" {i)]e^{i) + DC(C^D^DC)-ig(i) 

(50) 

e„(i + 1) = [I - 77u(i)u^(i)]e„(i) - 7;u(«)vJ j5?*(i)e„(i) 



ryu(i)wjp43ft*(i)v, 



opt 



(51) 



where H = I - DC(C^D^DC)-iC^D^ and we used 
the fact that the scalars have alternative expressions as 



ieim''i^)v,p,y^ ^ iei,m"i^)^r,p,) ^ f,^,^*i^)e^i^) 
and (e^(i)5R*(i)wopf)^ = (e^(i)SR*(i)wopt) = 
wJpfSi^ {i)ey{i). By taking expectations on both sides 
and eliminating the term fiIlf{i)\-opt^*{i)'Wopt we get 

E[e^ii + 1)] = [I - tiR{i)]E[e^{i)] + DC(C^D^DC)-i£;[g(i)] 
- ^,UE[f{^)^l^U"{{)]E[e,{{)] 

(52) 

E[e,{t + 1)] = [I - TjR^{t)]E[e,,{i)] - TjE[u{i)v^ptU* {t)]E[e^{i)] 
-rjE[uii)w^p,^*it)]yr,pt 

(53) 

By combining (52) and (53) the trajectory of the error vectors 
for the minimum variance case is given by: 



E[eUi + l)] 
E[e,{i + 1)] 





E[e^{i)] 


— Amv 


Ele^i)] _ 



B 



MV 



(54) 



where A 



MV 



and B 



MV — 



[I-^l■R{^)] 

-vE[u{^)^l,U*{^)] 



-^,^E[ri^)wL,n"i^)] 



opt^ 



DC(C^D^DC)-l^;[g(^)] 

opt 



-r]E[u{i)vi'^pt?fi*{i)]wo 



Equation (54) 



(48) 



suggests that the stability of the algorithms in the proposed 
structure depends on the matrix Amv- For stability, the 
convergence factors should be chosen so that the eigenvalues 
of A^jy Amv are less than one. 
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C. Trajectory of Excess MSE 

Here we describe the trajectory of the excess MSE at steady- 
state of the trained and the bhnd SG algorithms. 

1) Trained Algorithm: The analysis for the LMS algorithm 
using the proposed interpolated structure and the computation 
of its steady-state excess MSE resembles the one in |31| (see 
Chapter 9, pp. 390-404). Here, an interpolated structure with 
joint optimization of interpolator and reduced-rank receiver 
Wfe is taken into account. Despite the joint optimization, for the 
computation of the excess MSE one has to consider only the 
reduced-rank parameter vector Wfe because the MSE attained 
upon convergence by (13) and (14) should be the same. 
Here, we will drop the user k index for ease of presentation. 
Consider the MSE at time i + 1 as: 



where we used the property of trace and Q^Q = I. Because 
A is a diagonal matrix of dimension AI / L we have 



M/L 

^exc{i + 1) = X! -^"^"(^ 
n=l 



1) 



(59) 



where x„ 



1,2, ... M/L are the elements of the diago- 



nal of X(j). Here, we may use (45), invoke the independence 
theory ||3T1 (see Chapter 9, pp. 390-404) in order to describe 
the correlation matrix of the weight error vector: 



K(i + 1) =£;[e„(^ + l)e^(^ + l)] 



(I - /xR(i))K(i)(I - mR(z)) + 



(60) 



e{i + 1) = E[\b{i + 1) - vf"{i + l)r{i + 1) 



(55) 



By using w(i + l) = Wopt + e„(i + l), Wopt, ^opt and the fact 
that the expressions in (13) and (14) are equal for the optimal 
parameter vectors, the MSE becomes 

eii + 1) = al - p"{i + l)-R-\i + l)p{i + 1) 

- p"{t + l)e^(z + 1) - {i + l)p{i + 1) 



Next, using the transformations Q^RQ = A, Q^KQ = X 
and similarly to ||3T1 (see Chapter 9, pp. 390-404), a recursive 
equation in terms of X(i) and A can be written: 

X(z + 1) = (I - M)X(z)(I - fiA) + /i2e„„„A (61) 

Because of the structure of the above equation, one can 
decouple the elements a;„(i) from the off-diagonal ones, and 
thus ^excii + 1) depends on according to the following 

recursion: 



wfptP(i + 1) + KpM^ + 



H 



'opt 



i(i + 1) = (1 - ^iXn^Xnii) + IJ^er, 



(62) 



+ wfp,R(i + l)e„(i + l)+eZ{i + l)R(i + l)vfopt 
+ E[e,,{i + 1)t{i + l)Y"{i + l)e^(^ + 1)] 
= (tI - p"{i + l)R-^(i + l)p{i + l)+ 
E[e,,{i + l)f (i + l)Y"{i + l)e,^ [i + 1)] 

— JMMSE{^opt,^opt) + £,exc{i + 1) 

(56) 

where p(i + 1) = E[h*{i + l)f(i + 1)], e,„i„ = 

JMMSE{^opt,^opt) = O-fc -p^(i + l)R"H* + l)p(* + l) is 

the MMSE achieved by the proposed structure when we have 
Wopf and Wopt and £,exc{i + 1) = ^'[e^ (« + 1)? (« + {i + 
l)ew{i + 1)] is the excess MSE at time i + 1. To compute 
the excess MSE one must evaluate the term £,exc{i + !)• By 
invoking the independence assumption and the properties of 
trace 131] (see Chap. 9, pp. 390-404) we may reduce it as 
follows: 



At this point, it can be noted that such recursive relation 
converges provided that all the roots lie inside the unit circle, 
i. e., (1 — fiXn)^ < 1 for all n, and thus we have for stabihty 



< H < 



An 



(63) 



where Xmax is the largest eigenvalue of the matrix R. In 
practice, tr[R] is used as a conservative estimate of Xmax- 
By taking limi^.oo on both sides of (62), we get Xn{oo) = 
^min- Then, taking limits on both sides of (59) and 
using Xn{oo) we obtain the expression for the excess MSE at 
steady-state 



M/L M/L 

^excico) = ^ A„a;„(cx)) = 



^ 2 

n=l 



flXr. 



E[ei^{i+l)f{i+l)f"{i + l)eUi + l)]=tr R(i + l)K(i+l) 

(57) 

In the following steps, we assume that i is sufficiently large 
such that the matrix R(i) ~ R(oo) ~ R. To proceed let 
us define some new quantities that will perform a rotation 
of coordinates to facilitate our analysis as advocated in ||3T1 . 
Define Q^RQ = A, where A is a diagonal matrix consisting 
of the eigenvalues of R and Q is the unitary matrix with 
the eigenvectors associated with these eigenvalues. Letting 
Q^KQ = X we get 



^exc{i + 1) =tr 
= tr 



RK(i + 1) 



tr 



QAQ^QX(z + l)Q^ 



QAX(^ + 1)Q 



H 



= tr 



AX(i + 1] 



(58) 



_ f tr[R] 
" " 1-f tr[R]' 
(64) 

The expression in (64) can be used to predict semi-analytically 
the excess MSE, where R must be estimated with the aid of 
computer simulations since it is a function of the interpolator 
v(i). Alternatively, one can conduct the analysis for the 
interpolator v(i), which results in the expression ^exc(oo) = 

n. tr[R 1 

tr[Ti ] ^rnin, whcrc Tj is the Step size of the interpolator, 
the matrix R^ = R„(oo) and Tlu{i) — E[\i{i)vL^ [i]], as 
defined in connection with (12). A more complete analytical 
result, expressed as a function of both step sizes, ji and r/, 
and statistics of the non-interpolated observation vector r(i) 
requires further investigation in order to determine ir[R(oo)], 
that depends on rj or tr[Ru(oo)], that depends on ^x. Never- 
theless, such investigation is beyond the scope of this paper 
and it should be remarked that the results would not differ 
from the semi-analytical results derived here (that implicitly 
take into account the parameters of v(z)). 



11 



2) Blind Algorithm: Our algorithm is a minimum variance 
technique and its steady-state excess MSE resembles the 
approach in |6|. In the current context, however, an interpo- 
lated structure with joint optimization of interpolator Vfc and 
reduced-rank receiver is taken into account. In particular, 
it suffices to consider for the computation of the excess MSE 
only the reduced-rank parameter vector because the MSE 
attained upon convergence by the recursions, that work in 
parallel, for Wfe and Vfe should be the same. Here, we will 
drop the user k index for ease of presentation. Consider the 
MSE at time i + 1 as: 



e{i + 1) = E[\\b{i + 1) - w"{i + l)r{i + 1) 



(65) 



By using w{i + 1) = Wopt + e„, (i + 1) and the independence 
assumption the MSE becomes 



e{i + 1) - E[b{t + l)f^(i + l)]e^(i + 1) 

-e^{i + l)E[b*{i + l)r{i + l)] 
+ w^(R(i + l)e„(i + l) 

+ e^{i + l)R(i + l)Wopt + ^exc{i + 1) 



(66) 



where e™„ at -_E[b{i + l)v"{i + l)]\Vopt - ^^ptE[b*{i + 
l)r(i + 1)] + w^iR(i + l)wop4 is the MSE with the optimal 
reduced-rank receiver Wopt and the optimal interpolator Vopt 
and ^eic(* + l) = E[e^{i + l)'R.{i + l)e^{i + l)] is the excess 
MSE at time i + 1. Since limi^oo E[e^{i)] = 0, we have 



lim e{i + 1) 



+ lim Cexcii + 1) 



(67) 



Note that the second term in (67) is the steady-state excess 
MSE due to adaptation, denoted by ^^xc and which is related 
to w by 



^exc(oo) = lim trE[Iie^{i + l)e^(i + 1) 



(68) 



Let us define Re(i) = £'[et„(i)e^(i)]. Re = limi^ooRe(*) 
and use the property of trace to obtain 



^e:rc(oo) = tr£;[RRe] = vec" {TCjveciKe) 



(69) 



At this point it can be noted that to assess ^e2;c(oo) it is 
sufficient to study Re, that depends on the trajectory of the 
tap error vector For simplicity and similarly to [6 we assume 
that eg{i) w C^D^etu(i), which is valid as the adaptation 
approaches steady state. Using the expression of e^(i + 1), 
and taking expectation on both sides of e^{i + l)e^ {i + 1), 
the resulting matrix Re(i + 1) becomes 

Re{i + 1) w Re(i) - fi{Re{i)'R.{i)U + nRRe(i)) 

+ nf{i)f"{i)wopte^(i)n] (70) 
+ fi^E[Ufit)r"ii)iwopt^gt 
+ Re(i))f(i)f^(i)n] 

where U = 1 T>C{C"T>"T>C)^C"T>" . Since 

limi^ooRe(* + 1) = R-e and liuii^oo E[eii,{i) — 0, taking 
limits on both sides of (70) yields 



ReRn + nRRe 



^iE[Ur{t)f" {{){woptw" 

+ Re(l))f(l)f^(l)n] 



opt 



(71) 



Here an expression for fexc(oo) can be obtained by using 
the properties of the Kronecker product and arranging all 
elements of a matrix into a vector column-wise through the vec 
operation. Hence, the expression for the steady-state excess 
MSE becomes: 



^exc(oo) = tr RRe = fi wec^ (R)T~^a 



(72) 



where T = (RH) 



n]E 



I + I 



(HR) 



a = 



wec(wopfW^j) and 



(r(^)r(^)^)^ ® {r{i)r{i)") 

n ^;"(r(^)r^(^))^ (g) (r(i)r^(i)) 
(X) accounts for the Kronecker product. The expression in 
(72) can be used to predict semi-analytically the excess MSE, 
where the matrices R, T and the vector a are computed 
through simulations. 

D. Transient Analysis and Convergence Speed 

With regard to convergence speed, adaptive receivers/filters 
have a performance which is proportional to the number of 
adaptive elements M ([T], |fT9l . ifSTl . Assuming stationary 
noise and interference, full-rank schemes with RLS algorithms 
take 2M iterations to converge, while SG algorithms require 
at least an order of magnitude more iterations than RLS 
techniques |31|. In addition, it is expected that RLS methods 
do not show excess MSE (when a = 1 and operating in a 
stationary environment) and its convergence is independent of 
the eigenvalues of the input correlation matrix. 

With the proposed INT reduced-rank scheme, the conver- 
gence can be made faster due to the reduced-number of filter 
coefficients and the decimation factor L can be varied in order 
to control the speed and ability of the filter to track changing 
environments. In Appendices II and III, we mathematically 
explain how the INT structure can obtain gains in convergence 
speed over full-rank schemes with SG and RLS algorithms, 
respectively. 

For SG algorithms, the analysis of the transient components 
in Appendix II of the INT scheme reveals that the speed of 
convergence depends on the eigenvalue spread of the reduced- 
rank covariance matrix. In principle, we cannot mathematically 
guarantee that the INT always converges faster than the full- 
rank but several studies that examine the eigenvalue spread 
of the full-rank and the INT covariance matrix show that, for 
the same data, the INT structure is able to consistently reduce 
the eigenvalue spread found in the original data covariance 
matrix, thus explaining its faster convergence in all analyzed 
scenarios. 

For RLS techniques, the analysis of the transient compo- 
nents in Appendix III guarantees mathematically that the INT 
is able to converge faster due to the reduced number of filter 
elements and we show that the INT with the RLS converges 
in about 2M/L iterations as compared to the full-rank that 
requires 2AI iterations. 

VI. Simulations 

In this section we investigate the effectiveness of the pro- 
posed linear receiver structure and algorithms via simulations 
and verify the validity of the convergence analysis undertaken 
for predicting the MSE obtained by the adaptive algorithms. 
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We have conducted experiments under stationary and non- 
stationary scenarios to assess the convergence performance in 
terms of signal-to-interference-plus-noise ratio (SINR) of the 
proposed structure and algorithms and compared them with 
other recently reported techniques, namely adaptive versions 
of the MMSE 1 19| and CMV |6| full-rank methods, the eigen- 
decomposition (PC) |12|, |13| , the partial despreading (PD) 
Il5\ and the multi-stage Wiener filter (MWF) |18| reduced- 
rank techniques with rank D. Moreover, the bit error rate 
(BER) performance of the receivers employing the analyzed 
techniques is assessed for different loads, processing gains 
(N), channel paths (Lp) and profiles, and fading rates. The 
DS-CDMA system employs Gold sequences of length = 31 
and N = 63. 

Because we focus on the downlink users experiment the 
same channel conditions. All channels assume that = 6 as 
an upper bound (even though the effective number of paths 
will be indicated in the experiments). For fading channels, 
the channel coefficients hi{i) = piai{i) (l — 0,1,2), where 

^f=iPi = 1 ^'^'1 Q;;(*)' is a complex unit variance Gaus- 
sian random sequence obtained by passing complex white 
Gaussian noise through a filter with approximate transfer 
function c/ -^/l — {f / fdY where c is a normalization con- 
stant, fd ~ v/Xc is the maximum Doppler shift, Ac is the 
wavelength of the carrier frequency, and v is the speed of 
the mobile Ii33i . This procedure corresponds to the generation 
of independent sequences of correlated unit power Rayleigh 
random variables (£'[|af(i)|] = 1). The phase ambiguity 
derived from the blind channel estimation method in [28) is 
eliminated in our simulations by using the phase of g(0) as a 
reference to remove the ambiguity and for fading channels we 
assume ideal phase tracking and express the results in terms 
of the normalized Doppler frequency fdT (cycles/symbol). 
Alternatively, differential modulation can be used to account 
for the phase rotation. For the proposed interpolated receivers 
structures we employ M = {N + Lp — 1)/L adaptive elements 
for L — 2,3,4,8, and when M is not an integer we will 
approximate it to the nearest integer. For the full -rank receiver 
we have M ^ {N + Lp - 1). 

In the following experiments, it is indicated the type of 
adaptive algorithms used and their mode of operation, i.e. 
training mode, decision-directed mode and blind mode. For 
the training-based algorithms, the receiver employs training 
sequences with Ntr symbols and then switch to decision- 
directed mode. The full-rank receiver is considered with the 
NLMS and RLS tecnhiques, the interpolated receivers are 
denoted INT, the PC method OZl requires an SVD on the 
full-rank covariance matrix and the subspace dimension is 
chosen as D = K. For the PD approach, the columns of 
the projection matrix are non-overlapping segments of s^, 
as described in |15|, whereas for the MWF and its SG and 
recursive adaptive versions (MWF-SG and MWF-rec) [IS] the 
number of stages D is optimized for each scenario. The RAKE 
receiver in supervised mode uses the NLMS and the RLS 
techniques and the training sequence in order to estimate its 
parameters. With respect to blind algorithms and the full-rank 
receiver, the SG algorithm corresponds to the one in [6l with 
a normalized step size similar to the one introduced in Section 
IV. A and the RLS corresponds to the one reported in [6 |. The 



proposed interpolated receiver, i.e. the INT, uses the CMV- 
SG and CMV-RLS algorithms especially designed for it. The 
different receiver techniques, algorithms, processing gain N, 
the decimation factor L and other parameters are depicted on 
the legends. The eigen-decomposition based receiver of Wang 
and Poor [13] is denoted Subspace-W & P and employs an 
SVD to compute its eigenvectors and eigenvalues. With regard 
to blind channel estimation we employ the method in |28| for 
all SG based receivers, while for the RLS based receivers 
we adopt |28|. The blind MWF and its adaptive versions 
(blind MWF-SG and bUnd MWF-rec) 118| have their rank 
D optimized for each situation and employ the blind channel 
estimation in f28l to obtain the effective signature sequence 
in multipath. For the RAKE receiver [33] , we also employ the 
SG blind channel estimation of 1281 when compared to other 
SG based multiuser receivers, whereas for the comparison with 
RLS based receivers we use its RLS version [i28i. 



A. MSB Convergence Performance: Analytical Results 

Here, we verify that the results (64) and (72) of the sec- 
tion on convergence analysis of the mechanisms can provide 
a means of estimating the excess MSE. The steady state 
MSE between the desired and the estimated symbol obtained 
through simulation is compared with the steady-state MSE 
computed via the expressions derived in Section VI. In order 
to illustrate the usefulness of our analysis we have carried out 
some experiments. The interpolator filters were designed with 
Nj — 3 elements and the channels have 3 paths with gains 
0, —6 e —10 dB, respectively, where in each run the delay of 
the second path (T2) is given by a discrete uniform random 
variable (r v.) between 1 and 4 chips and the third path is 
computed with a discrete uniform r v. between 1 and (5 — 
chips in a scenario with perfect power control. 




500 1000 
Number of received symbols 



Number of received symbols 



Fig. 4. MSE convergence for analytical and simulated results versus 
number of received symbols using (a) trained LMS algorithms and 
(b) blind SG algorithms. 



In the first experiment, we have considered the LMS al- 
gorithm in trained mode and tuned the parameters of the 
mechanisms, in order to achieve a low steady-state MSE upon 
convergence. The parameters of convergence, i.e. /i, are 0.05, 
0.06, 0.075 and 0.09 for the full-rank and the INT with L=2,3 
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and 4, respectively, and 77 = 0.005 for the interpolator with 
all L. The results are shown in Fig. 4 (a), and indicate that 
the analytical results closely match those obtained through 
simulation upon convergence, verifying the validity of our 
analysis. 

In the second experiment, we have considered the blind 
SG algorithm and tuned the parameters of the mechanisms, in 
order to achieve a low steady-state MSB upon convergence, 
similarly to the LMS case. The chosen values for /i are 
0.0009, 0.001, 0.0025 and 0.004 for the full-rank and the 
INT with L=2,3 and 4, respectively, and 77 — 0.005 for 
the interpolator with all L. The curves, depicted in Fig. 4 
(b), reveal that a discrepancy is verified in the beginning 
of the convergence process, when the estimated covariance 
matrix is constructed with few samples. Also, this mismatch 
between the theoretical and simulated curves is explained by 
the fact that blind algorithms are more noisy than trained 
techniques Q. However, as time goes by and the data record 
is augmented, the statistics of the signals is acquired and 
the modelled and simulated MSB curves come to a greater 
agreement. 

B. SINR Convergence Performance 

The SINR at the receiver end is used here to assess 
the convergence performance of the analysed methods. In 
the following experiments we will assess the SINR perfor- 
mance of the analyzed adaptive receiver techniques and their 
corresponding algorithms, namely, the proposed interpolated 
receiver, the PC, the PD, the MWF and the RAKB. We remark 
that the parameters of the algorithms have been tuned in order 
to optimize performance and the receiver parameters have been 
carefully chosen to provide a fair comparison amongst the 
analyzed methods. 



(a) NLMS, N-31, K=4 users 



(b) RLS, N-63, K-8 users, f T=0.005 



(a) RLS, N.63, K = 12 



(b) RLS, N-63, K-15 users 




- Full-rank 
INT(L-3)-3 fixed taps 

- INT{L-3)-3 adap. taps 

- INT{L-3}-4 adap. taps 

- INT{L-3)-5 adap. taps 

- INT{L-3)-6 adap. taps 




- Full-rank 
INT{L-4)-3 fixed taps 

- INT{L=4)-3 adap. taps 

- INT{L=4)-4 adap. taps 
INT{L-4)-5 adap. taps 

- INT{L-4)-6 adap. taps 
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Fig. 5. Design of interpolator filters to obtain the best dimensions 
for Ni with random 3-path channel parameters (r. v. between —1 
and 1) as given in Section VI.A and the scenario has equal power 
users, (a) trained RLS type algorithms at Eb/No = 12 dB (b) blind 
CMV-RLS type algorithms at Eb/No = 15 dB. 



Firstly, let us consider the issue of how long should be the 
interpolator filter. Indeed, the design of the interpolator filter 




Full-rank 
INT(L.2)-fix. int. 
INT(L.2) 
INT(L.3) 
INT(L=4) 
PC(D=K) 
PD-18taps 
PD-9 1! 
MWF(D=5)-SG 
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Fig. 6. SINR performance of the receivers with (a) NLMS, Eb/No = 
8 dB and three interferers with power levels 7 dB above the desired 
user and channel parameters po — 1, p2 = 0.5 and p4 = 0.3 (spaced 
by 2Tc) and (b) RLS, Et/No = 12 dB and three interferers with 
power levels 10 dB above the desired user with fading and channel 
parameters po = 1, P2 = 0.7 and p4 — 0.5 (spaced by 2Tc). 



is a fundamental issue in our approach because it affects its 
convergence and BBR performance. In order to obtain the most 
adequate dimension for the interpolator filter v^, we conducted 
experiments with values ranging from iV/ = 3 to Nj = 6, as 
the ones shown in Fig. 5 for the supervised and blind modes 
with the RLS, respectively. The results indicate that SINR 
performance was not sensitive to an increase in the number of 
taps in Vfc and the best results for all algorithms were obtained 
with Nj = 3. For this reason and to keep the complexity low 
we selected Nj = 3 for the remaining experiments. We also 
remark that the simulation aided design of the interpolator 
dimension was carried out for systems with different N, K, 
L, channel profiles and fading rates, indicating that Nj = 3 
is a satisfactory dimension. The SINR convergence curves 
show that the proposed structure with adaptive interpolators 
is considerably superior to the fixed interpolator approach and 
to the full-rank receiver 

Fig. 6 illustrates experiments where the INT is compared 
to other reduced-rank techniques in training and decision- 
directed modes. In both experiments, a training sequence is 
provided to the receivers with 200 symbols, and then the 
algorithms switch to decision-directed mode. The parameters 
of the receivers for all methods were optimized and the results 
show that the proposed structure with adaptive interpolators 
and L = 2 achieves the best performance and is significantly 
superior to the INT with a fixed interpolator The convergence 
performance of the INT for various L is superior to the full- 
rank one and to the PC and PD methods. The PC method 
performs well when K is small but it is outperformed, both in 
terms of convergence speed and final SINR, by the INT with 
L = 2,3. The INT with L = 3 and L = 4 are also superior to 
the PD method with 18 and 9 elements, whereas the INT with 
i = 4 has a performance comparable with the MWF adaptive 
versions. 

In Fig. 7 the SINR performance of the analyzed receivers 
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(a) RLS, N.63, K=1 6 users, f J.0.001 (b) RLS, N.63, E /N .1 5 dB, f T.0.001 
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Fig. 7. SINR performance of blind (a) SG algorithms with channel 
parameters po — 1, P2 ~ 0.5 and p4 — 0.5 (spaced by 2Tc) where 
2 interferers work at a power level 7 dB above the desired user that 
operates at Et/No = 15 dB with fading (b) RLS algorithm with 
Eb/No — 15 dB without fading, the 3-path channel parameters are 
random as in Section VI.A and the received powers of the interferers 
are log-normal r. v. with associated standard deviation 3 dB. 



is examined in blind mode. The parameters of the receivers 
for all methods were optimized and the results show that the 
proposed structure with adaptive interpolators and L ~ 2 
achieves the best performance. The convergence performance 
of the novel structure for various L is superior to the full-rank 
one and to the other methods. Note that subspace approach 
of Wang and Poor performs very well for small K but when 
K is larger its performance degrades considerably. The INT 
shows very good performance in all situations and requires 
lower computational costs than the other techniques. 

C. BER Performance 

In this section, the BER performance of the different re- 
ceiver techniques is investigated. In Fig. 8, the BER curves 
for the RLS algorithms in trained and decision-directed modes 
are shown. The channel parameters are po = 1, Pi = 0.7 and 
P2 — 0.5, where in each run the delay of the second path (T2) 
is given by a discrete uniform r v. between 1 and 4 chips and 
the third path is computed with a discrete uniform r v. between 
1 and (5 — T2) chips. In these experiments the received powers 
of the interferers are log-normal r. v. with associated standard 
deviation 3 dB. We remark that the proposed methods also 
perform well with other channel profiles and fading rates. The 
receivers are trained with 200 symbols, are then switched to 
decision-directed mode and process 2000 symbols, averaged 
over 200 experiments with optimized parameters for each 
scenario. The results show that the INT with L — 2 achieves 
the best performance, followed by the full-rank receiver, the 
INT with i = 3, the MWE the PD approach, the INT with 
i = 4, the PC and the RAKE receiver 

In Fig. 9, the BER curves for the RLS-type algorithms in 
blind mode, respectively, are shown. The receivers process 
2000 symbols, averaged over 200 experiments with optimized 
parameters for each scenario. In these simulations the received 



Fig. 8. BER performance of trained RLS algorithms versus (a) 
Eb/No (b) number of users. 



powers of the interferers are log-normal r. v. with associated 
standard deviation 3 dB. The results show that the INT with 
L = 2 achieves the best performance, followed by the full- 
rank receiver, the INT with L = 3, the MWF, the INT with 
L = 4, the subspace receiver of Wang and Poor and the RAKE 
receiver Note that the receivers can accommodate more users 
and cope with larger systems when working with RLS type 
algorithms and that the INT structure with L — A outperforms 
the RAKE and Wang and Poor's (for if ^ 8) receivers, the 
INT with L = 2 outperforms the full-rank receiver and the 
INT with L = 3 has a very close performance to the full- 
rank. The blind MWF versions are slightly inferior to the INT 
with L = 3 and suffer from the fact that tri-diagonalization 
does not occur, deteriorating its performance. 
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Fig. 9. BER performance of blind RLS algorithms versus (a) Eb/No 
(b) number of users. 
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VII. Conclusions 

We proposed adaptive reduce-rank receivers for DS-CDMA 
based on interpolated FIR filters with adaptive interpolators. 
The novel receiver structure and algorithms were assessed 
in various scenarios, outperforming previously reported tech- 
niques with a very attractive trade-off between performance 
and complexity. An analysis of the convergence properties 
of the method was undertaken, indicating that the novel 
cost function does not exhibit local minima. Furthermore, 
a convergence analysis of the algorithms was shown to be 
valid in predicting the excess MSB upon convergence for the 
blind and trained SG algorithms. In terms of computational 
complexity, the AIFIR receivers are simpler than the full-rank 
receiver, much simpler than reduced-rank eigen-decomposition 
techniques and competes favorably with the MWF The BER 
performance of the interpolated receivers is superior to the 
subspace receiver, the MWF and close to the full-rank one 
even with L — A. Finally, with respect to convergence, 
the proposed receivers exhibit a faster response and greater 
flexibility than other analyzed methods since the designer can 
choose the decimation factor depending on the need for faster 
convergence or higher steady-state performance. 



to Ell, O (see Chapter 9, pp. 390-404 and Appendix J, 
pp. 924-927 with the solution to differential equations), let us 
express the excess MSB in (59) as a function of its transient 
and steady-state components: 



M/L 

E 

n=l 



M/L 



ji=i 



x(oo)] + ^exc{oo) 



^,:A«g„gf[x(o) 

(77) 



where c„ is the nth eigenvalue of an M/L x M/L matrix T 
whose entries are 



(1 - mA„)2 



j 



(78) 



According to the above equation the speed of convergence of 
the proposed INT structure for SG algorithms is given by the 
transient component Ctransii) = En^i" c,;A^g„g^[x(0) - 
x(oo)] which can be alternatively expressed by: 



S) 



M/L 



(79) 



Appendix 

Let R be a positive semi-definite Hermitian symmetric 
matrix and its eigenvalues be ordered as Xmax = Ai > 



A. ^ 



7? A 



N- 



-i > \n = Xmin ^ with corresponding 
eigenvectors q„i (to = 1,2,...,A^). Consider an initial 
vector v(0) = X]m=i CrnQm, whcrc c,n are scalars with 
ci — v^(0)qi 7^ 0. Using the power iterations we have 

v(i) = Rv(i - 1) = R'v(O) = ciA^qi + caA^qa + . . . + cn 



= ciAKqi + C2/ci(A2/Ai)''q2 

(73) 



where 7„ — A''^g„g^[x(0) — x(c)o)]. Note that the transient 
component ^transii) ~> as i — > oo. By using the exist- 
ing expression for the transient component of the full-rank 
receiver described by Ct™n7''''"'°(*) = E^lii 7n4 Oil, we 
can establish conditions for which the transient component of 
the INT receiver defined in (79) can vanish faster, i.e., the 
INT scheme converges faster If the INT scheme reduces the 
y eigenvalue spread of its covariance matrix, we have for the zth 
^i^eration 



CN / ci{Xn / Xiftik 



If we normalize the above equation we obtain 

(qi + C2/ci(A2/Ai)''q2 + . . . + cjv/ci(AAr/Ai)*qjv 



n=l 



M 



(80) 



v(0 

|v(i)|| ||(qi +C2/ci(A2/Ai)'^q2 



and since ^ -r^ < 1, 



, . + CAr/ci(AAr/Ai)*qjv 
(74) 

for 2 ^ m ^ iV it results that 



The above condition states that the transient component of the 
duced-rank INT scheme has fewer decreasing modes and 



lim,: 



(A^y = 0, for 2 ^ TO ^ TV. Thus, we conclude fliat 



lim 



v(z) 



z^R, where v 



(75) 

l/tr[R] and 
1, . . . ,iV. Since 



|v(z) 

Now, let us make A = I 
whose eigenvalues are A' = 1 , 

tr[R] = Y.m=i '^rn ^ Ai = Krax, it rcsults that ^ A'l ^ 
A2 ^ . . . ^ X'n^i < A^. Therefore, from the development in 
(73)-(75), we have that the recursion v(j) — (I— zyR)v(i— 1), 
i = 1, 2, . . . results in 

v(*) 



'anishes before that of the full-rank structure. To verify (80), 
we studied the eigenvalue spread of the covariance matrices 
of the INT and the full-rank schemes in an extensive set 
of scenarios. In all situations, the experiments indicate an 
increase in the convergence speed and also that the INT can 
reduce the eigenvalue spread of the full-rank scheme. 

Here we evaluate the convergence speed of the proposed 
INT receiver scheme through the MSB analysis of RLS 
algorithms. By using a similar analysis to [i3 1 i (see Chapter 13, 
pp. 573—579) and replacing the expectation operator with time 
averages, let us express weight error vector of the reduced-rank 
INT least squares solution: 



lim 



(76) 



w(?;) 



'opt 



R 



{^)J2r{l)e:{l) (81) 



where q^ is the normalized eigenvector of A associated to 
1 ~ tTTt^' *at is (I - J/R)q'^ = (1 - 



\' _ \' _ 

^max ~ ~ ^ tr[Ru^^ 

i^XminWN and hence q^ = 

In this appendix we assess the convergence speed of the 
proposed INT receiver scheme through the transient compo- 
nent analysis of SG algorithms. By using a similar analysis 



Using the definition for the weight error correlation matrix 
K(i) - S[e^(i)e^(i)] m we have: 



K{i) = E 



R (z)^^r(Oe:(OeoO>^(j)R" i^) 

1=1 j=i 



(82) 
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Assuming that eo{i) is taken from a zero mean Gaus- 
sian process with variance cr^, we have E[eo{l)el{j)] ~ 



0, l^J 
K(i) = a^E 



and 

1=1 j=i 



R (z) 
(83) 



By invoking the independence theory and using the fact that 

the estimate of the covariance matrix given by R (i) is 
described by a complex Wishart distribution liSTj (see Section 

13.6), the expectation of the time averaged estimate R [i) 
is exactly 



E[fC'{i)] = 



1 



-R \ 



> M/L + 1 (84) 



i - M/L - 1 " 

where R~^ is the theoretical reduced-rank covariance matrix 
and thus 

a^R-i 



K(z) 



i > M/L + 1 



(85) 



i - M/L - 1 ' 

Using the expression that describes the excess MSE in (58) 
we get 



Cexc{i) = tr RK(i 



(T^M/L 
i - M/L - 1 



i > M/L + 1 
(86) 

The above result shows that the learning curve of the RLS 
algorithm with the proposed reduced-rank structure converges 
in about 2M/L iterations, in contrast to the RLS with the 
full-rank scheme, that requires about 2M iterations [31 1. This 
means that the proposed scheme converges L times faster 
than the full-rank approach with RLS techniques. Another 
observation from (86) is that as i increases the excess MSE 
tends to zero (for A = 1) and it is independent from the 

eigenvalue spread of R (i). 
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